Unsupervised NN approach and PCA for Background – Foreground video segmentation
نویسندگان
چکیده
MPEG-4 based video coding applications require the segmentation of each video image in its principal moving objects to be coded independently from each other. Several techniques of video objects segmentation for coding purposes have been presented in literature; all such segmentation techniques are based on the smart soft-thresholding of the motion fields, the best ones dealing with dense motion fields. Anyway, MPEG-4 based coding structures require a block based (sparse) motion field estimation. The use of block based coding structures, don’t allow fair video objects segmentation for the intrinsic inaccuracy of motion estimate the block based structure of the motion field, specially on moving object border blocks. In this context the segmentation obtained basing only on motion information is inaccurate, but it can be enhanced by the joint use of several information at hand, like color, motion, frame difference, prediction error, texture and so on. In this work a locally connected unsupervised neural network approach is presented, to obtain the segmentation of a moving video object (VO) on a fixed or slow–translating background. The purpose of background-foreground segmentation is here addressed by a split and merge like criterion: the neural network is applied to avail on correlation between several image components such as color, motion vectors and frame difference to simplify each image of the sequence into disjoint regions, thus reducing the dimensionality of the problem. The segmentation is obtained working on sub-spaces the whole subspace spanned by the several image components used to define the image, in a partition tree scheme. The second step consists on a Principal Component Analysis (PCA) to simplify the several clusters obtained in the first phase and obtain a two sets partition of the whole image. As the main information used in this phase deal principally on motion estimates and frame difference the PCA simplification step is likely to produce an image partition close to the searched BF segmentation. Preliminary results of the proposed method applied on “Foreman” video sequence seem promising. Network parameters tuning to make the proposed algorithm feasible is still needed; applications to a MPEG-4 based video coding structure is a future goal we will pursue. A complete neural network structure designed to produce the searched image segmentation, based on results of this preliminary study, is under construction.
منابع مشابه
Robust 2D Moving Object Segmentation and Tracking in Video Sequences
− Unsupervised motion segmentation and tracking in video sequences is a complex task, requiring robust estimation and flexible modeling. The paper presents an unsupervised method of moving object segmentation and tracking in video sequences captured by static cameras. Central to our work is the nonparametric density estimation and the mean shift algorithm for finding local maxima of the probabi...
متن کاملReal-time Compressed-domain Spatiotemporal Video Segmentation
In this paper, a novel algorithm for the real-time, unsupervised segmentation of image sequences in the compressed domain is proposed. The algorithm utilizes the motion information present in the compressed stream in the form of P-frame forward motion vectors, as well as basic color information in the form of DC coefficients present in I-frames. An iterative rejection scheme based on the biline...
متن کاملVOS-GAN: Adversarial Learning of Visual-Temporal Dynamics for Unsupervised Dense Prediction in Videos
Recent GAN-based video generation approaches model videos as the combination of a time-independent scene component and a time-varying motion component, thus factorizing the generation problem into generating background and foreground separately. One of the main limitations of current approaches is that both factors are learned by mapping one source latent space to videos, which complicates the ...
متن کاملOR-PCA with MRF for Robust Foreground Detection in Highly Dynamic Backgrounds
Accurate and efficient foreground detection is an important task in video surveillance system. The task becomes more critical when the background scene shows more variations, such as water surface, waving trees, varying illumination conditions, etc. Recently, Robust Principal Components Analysis (RPCA) shows a very nice framework for moving object detection. The background sequence is modeled b...
متن کاملCapturing video structure with mixture of probabilistic index maps
The ability to segment or separate foreground from background in video images is useful to a number of applications including video compression, human-computer interaction, and object tracking to name a few. In order to generate such segmentation in both a reliable and visually pleasing manner the fusion of both spatial and temporal information is required. This fusion typically requires to pro...
متن کامل